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ABSTRACT 


This thesis seeks to use probability models to investigate the effects and value of 
battle damage assessment (BDA) information availability on sequential tasks encountered 
in the defense against missile attacks. Different levels of information will have different 
impacts on the outcome of the battle. Additional information could increase the 
effectiveness of the defensive weapon system. On the other hand, the enemy could use 
deception techniques, electronic warfare (EW) and Decoy measures on the information- 
gathering methods to disrupt the acquisition of information which would decrease the 
effectiveness of defensive weapons. In the models, we show how to best allocate limited 
resources; 1.e. the available kill time, to maximize the reward. We define a measure of 
effectiveness (MOE) for information which can be used for evaluating information value 
and decision making. We discuss different strategic alternatives and information value for 


both defenders and attackers in electronic warfare (EW). 
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EXECUTIVE SUMMARY 


In recent years, many countries have acquired short range ballistic missiles. As a 
result, many other countries have become possible targets for these missiles. Even when 
these missiles carry conventional warheads they levy a psychological and political penalty 
on the defending nation. Naturally, the defending nations are seeking effective ways to 
repulse ballistic missile attacks. 

Defense, for most of the nations who believe they may be subject to a ballistic 
missile attack, is in the terminal stage of the ballistic missile trajectory; that is, when the 
missile reenters the atmosphere in a path towards its intended target. This thesis explores 
probability models to investigate the effects and value of information available to perform 
sequential tasks in the defense against a ballistic missile attack. 

The information available for these sequential tasks encountered in a ballistic 
missile defense is termed battle damage assessment (BDA). This thesis investigates 
different levels of information for the battle damage assessment and the varying impacts 
this will have on the outcome of the battle. It is assumed that additional information could 
increase the effectiveness of the defense’s weapons systems. Scenarios are explored where 
the enemy uses electronic warfare and decoys to disrupt the defender’s acquisition of 
information. The intent of these models is to produce results for allocating limited 
resources and maximizing the available kill time and the probability of destroying a real 
missile rather than an electronic image or a decoy. The models are discussed from both the 
view point of the defender and the attacker. 

First, the author presents a generic scenario which will be used throughout the rest 
of the study to analyze probabilities for both defender and attacker. Each scenario is 
further categorized by an investigation with battle damage assessment information and a 
scenario in which there is no battle damage information. Next, a measure of effectiveness 


is developed for the investigation. Finally, the author presents an investigation of the effect 


of countermeasures on all the scenarios developed earlier in the study and a more detailed 
presentation of the effects of electronic warfare on the defensive systems. 

The implication of the study are that theater missile defense would require 
enormous investments in research and the resources of the defending country. If the 
defender pursues a theater missile defense it must be done very carefully. When a 
successful attack takes place, even the crudest of ballistic missile might have a 
psychological effect out of proportion to the military value of the weapon. For example, in 
the Gulf War, the allies expended enormous resources to destroy missiles which when 
launched, rarely found a target and in any case, were not aimed at any militarily valuable 
target. These weapons had political effects well beyond their real military value. 

Throughout the study, it is apparent that theater ballistic missile defense is closely 
tied to the battle damage assessment information system. Without the development of 
these systems it will be impossible for the defender to allocate the resources of kill time 
and weapons to achieve the best outcome for the battle. 

The effect of electronic warfare and different levels of deception on the battle 
damage assessment information can have important effects and should be carefully 
investigated to ensure that the defender receives the benefit of more information. An 
improved kill rate on the first target in the scenarios presented, increases the value of 
battle damage assessment. 

Suggested follow-on research is the problem of target identification to determine 
whether the target is a missile or a decoy. With reliable target identification information, 
the problems presented in this thesis would be reduced to improvements in weapons 
efficiency, and the scenarios themselves would consist of which target to identify first 
rather than which target is the most probable warhead. 

This thesis presents a very simple scenario. However, the probability models 
identify some real world problems. The author found that the information provided by a 


battle damage assessment greatly affects the effectiveness of any defense scenario. 


Sometimes this effect is unexpected, and this subject should be rigorously studied before 


any such system ts actually built. 


XI 





I. INTRODUCTION 


A. BACKGROUND 


Technological advances have made it possible for many countries to acquire 
ballistic missiles. As a result of the advances in technology and the willingness of some 
countries to sell that technology, the likelihood of ballistic missile attack is increasing. 
Ballistic missiles can deliver conventional and unconventional weapons over extended 
ranges. Even when a ballistic missile carries a conventional warhead it can still cause 
significant damage to the capability of the defender. From the experience of The Gulf War, 
we know that ballistic missile attacks can cause problems not only to the infrastructure and 
to human lives but can also create psychological and political problems in that a nation 
lives in constant fear of attack. The nation’s economy will suffer and the nation’s internal 
cohesiveness may decrease. 

Many countries are seeking an effective way to defend themselves against ballistic 
missile attacks. For example, the United States started the Ballistic Missile Defense 
Program (BMD) in April 1984 [Ref. 1]. The current BMD program contains three major 
parts: 

1. Theater Missile Defense 

Theater Missile Defense (TMD) program is intended to provide highly effective 
TMDs to forward deployed forces and to U.S. Forces and allies. The TMD defense can 
take many forms. The Joint Chief of Staff categorizes TMD mission needs as, first, a 
passive defense to enhance the survivability of friendly forces and assets; second, a battle 
Management/Command, Control and Communications and Intelligence system (BM/C31) 
to provide effective communications, command and control of the TMD operation and to 
ensure data flow, third, an attack operations ( Counter-force) for the destruction of the 
enemy’s capability to launch missiles; and finally an active defense to intercept the Theater 
Ballistic Missile (TBM) in flight so as to either destroy the TBM or negate the effects of 


the warhead. 


pe U.S. National Missile Defense 

National Missile Defense is a research and development project for the 
development of ground based-defenses to protect the U.S. from a limited ballistic missile 
attack. The Army’s National Missile Defense (NMD) system will operate with external 
Early Warning (EW) sensors (Space and Missile Tracking System, DSP and EWR) and 
the United States Space Command’s (USSPACECOM) Command and Control Center via 
a Command-Level Battle Management Command Control and Communications (BMC3) 
network. The Army configuration of the proposed NMD system includes ground-based 
exo-atmospheric hit-to-kill interceptors, a ground-based phased array and national defense 
radar (for surveillance, tracking, object classification and kill assessment) and Battalion 
BMC3 (Bn BMC3) (for human-in-control, engagement planning, top level decision 
making and system communications) [Ref. 2]. 

3: Follow-On Research 

Follow-on research supports more advanced BMD technologies. For example, to 
increase the effectiveness of the weapon system, issues related to the flow and utilization 
of information play an important role in the BMD program. The BMD program represents 
an investment of $34,683.3 million dollars [Ref. 1]. These resources can be used to 
improve the effectiveness of weapon systems or improve the acquisition flow and 
utilization of information. However, how do we evaluate the value of information? What 
are the trade-offs of investing money on weapon or information? How can we effectively 


use information to get the best results? 
B. ACTIVE DEFENSE 


Considering the trajectory of a ballistic missile, the active defense can be divided 
into three opportunity stages: 

I. The Boost Stage 

The boost stage is the early portion of missile flight. In this stage the missile 
engine will burn to produce thrust until it reaches terminal velocity. There are advantages 


to engaging in this phase. The first is that the missile ts in its early flight and cannot expel 


to 


its multiple warheads and decoys. The second is that the missile is probably still above the 
attackers’ territory and will not cause any damage to the defender. 

Because of short time and longer distances represented by this stage, the stage 
requires a relatively higher technology to intercept a missile. We may use space-based 
sensors and weapons (“Brilliant Eyes/Pebbles’”) to accomplish the task. For the reason of 
defense budget resources, this method would probably not be used in the post Cold-War 
world. 

ae The Midcourse Stage 

In this stage the missile follows a ballistic path. It is desirable to intercept in this 
stage because of the advantage of destroying the missile outside the defender territory or 
at very high altitude. At this stage it is less difficult to detect a target and to guide an anti- 
ballistic missile weapon than in the boost stage. Additionally, the missile is moving 
relatively slower than in the terminal stage. However, the enemy may use decoys or other 
countermeasures to make the defenders’ task more difficult. 

3. The Terminal Stage 

In this stage the missile re-enters the atmosphere and follows a downward path to 
its target area. The defender at this stage has more resources to destroy the missile. For 
example, the use of air-defense missiles and aircraft. The problem at this stage is that the 
missile is moving at a very high speed and offers only a small window for interception. 
Additionally, destruction of the missile will result in scattering debris that may still result in 
the missile being an effective political weapon even though a less then effective military 
one. 

4. Summary 

The stages of the ballistic missile trajectory can be thought of as three unique 


Opportunities or time windows for the defense to engage the missile attack. 
c, THE ROLE OF INFORMATION 


Information is power in modern warfare and modern business. An effective system 


must combine information technology with other resources to achieve the best results. The 


Ballistic Missile Defense needs an extremely accurate and fast strike capability which can 
only be accomplished with a high speed command, control, communication, intelligence 
(C31) and battle awareness information system. Information can be obtained from external 
sources, such as satellites, early-warning air radar, ground-based radar and other sensor 
systems. The system may produce information on target identification, detection, 
tracking, and project battle damage. Different levels of information will have different 
impacts on the outcome of the battle. Additional information could increase the 
effectiveness of the defensive weapon system. On the other hand, the enemy could use 
deception techniques on the information-gathering (EW and Decoy measures) methods to 
disrupt the acquisition of information which would decrease the effectiveness of defensive 
weapons. 

This thesis seeks to use probability models to investigate the effects and value of 


information availability on sequential tasks encountered in defense against missile attacks. 


If. MODEL DESCRIPTION 


A. THE GENERIC SCENARIO 


This thesis is mainly based on study of a generic scenario and the assumptions in 
the following paragraphs. These assumptions restrict the problem to two tasks and the 
decision as to which task to pursue to obtain maximum combat effectiveness. The tasks 
are performed by a weapon system (server). The weapon system (server) contains a 
sensor-C3 sub-system and a ballistic missile interceptor. It 1s confronted with the 
opportunity to address particular and temporarily available tasks; that is to destroy 
incoming targets. Each task requires an acquisition time T, to acquire enough detection 
and identification information for an incoming missile. While the tracking information is 
available the interceptor can start to deliver a lethal kill. After a lethal kill time T, the 
target will be destroyed. 

Because of the ballistic missile trajectory there is only a limited period of time 
(window) to engage target /, 1.e. accomplish Task 7. Let's assume that the server starts to 
work on Task | from time t;; = 0 and the time available to complete Task | extends to ti2 
> ty, The opportunity to work on Task 2 begins at t21, where tz; => 0, resulting in an 


overlapped region between the two tasks of t)2 - t2;. (See Figure 2-1); the opportunity to 





Figure 2-1. Overlapping Tasks 


work on Task 2 ends at time t22 

More than two tasks could be overlapping in a real world problem, but for the 
purposes of this study only two tasks will be considered. Assume that at any moment the 
time needed to complete service on Task 7 is T;= T, + Ty. For a glimpse-type sensor the 
acquisition time T, will be modeled as an exponential random variable which has mean 
|/A,. Depending on the characteristics of the weapon, we may choose different 
distributions for the lethal kill time T;,. For convenience, we assume that the time to 
complete task / has an exponential distribution with mean 1/ A,. Since 1/A, is the expected 
survival time of target 7, the parameter 4, can be thought of as the relative efficiency 
factor of an anti-ballistic missile weapon. A weapon with a higher A, is a more efficient 
weapon, because within a limited time interval the chance to complete a task is higher. 

Assume that at any moment in time there is a chance to kill a target that is 
independent of events that have preceeded that point (t, t + dt) in time (= A, dt, where dt is 
a small time, although the completion rate A; could depend on time and distance: A, (t ,d) 

We will allow the server, shooter, or sensor to “attack” just one task at a time. However, 
under some conditions one could co-allocate “shots” across several tasks or targets. A 
BMD system will assign different values to different targets. If we finish a task, we will be 
rewarded with the task’s value. On the other hand if we do not finish then the task will not 
have any value. The task value relates to the damage the missile might cause to the 
defender in case it is not intercepted. 

The issue of primary interest in this paper is how to best allocate limited server 
capacity to the two tasks in the light of the available information. Ultimately, this will 
determine how much effort to expend for an increase in opportunity time. This will also 
lower service (reaction/response/kill) times of the server and reduce the effect of target 
duping and other attacker tactics. 

The following discussion of models reflects varying levels of information that 


might be available to the defender or server. It is assumed that the server has knowledge 


6 


of the various times t2), t)2, t22, sufficiently in advance to allow the server to accommodate 


the decision making process. 


B. MODEL WITH PERFECT ACQUISITION INFORMATION BUT 
WITHOUT BATTLE DAMAGE ASSESSMENT (BDA) INFORMATION 


In this model we will assume that the BMD system has perfect target acquisition 
information, including detection, identification, and localization information. So there is no 
acquisition time model required. The defense system knows when Task / is available but 
does not know when it is or will be completed. Since there is no-BDA information the 
system will use a threshold policy to engage a target. The rule of engagement is first- 
come, first-served. The system starts immediately with Task | and pursues service of Task 
| for threshold time tT )2( here to) < T12 < ti2). Then the system switches to Task 2 after T j2 
and continues to t22. If we finish Task 1 we will have received the value for the first target, 
V,. If the system completes Task 2 in time the result 1s value V2. Assume the task values 
are additive. Realistically, in a real world situation, the values need not add; including the 
defended target’s value accounting for the effect of a second strike on the same target may 


be smaller in terms of infrastructure or more in terms of personnel. 


r Expected Reward 

We will assess the reward of a policy by the expected sum of the values of the two 
tasks accomplished. Let V(z,,) be the expected reward from following the above ru/e, that 
is, if the system switches targets between end of the available time for Task 1, and the 


beginning of the available time for Task 2: 
V(r.) =V[1-e** | +9, fi-e#'=)] 
-A,r, -Ay(ty-2)] (1) 
=v, +V, -|Ve Ann tee | Mn SS Tae 


cf. Gaver and Jacobs (1996) [Ref. 7]. 


The reward if the system switches tasks at beginning of the available time for Task 2 (7,, = 


t,) 


(0) = ma =a + (1 = aac iSe 


: ‘ ; =A, (ia =t5) (2) 
=r -[re* ae Venta | 
The reward if the system switches at the end of the available time for Task I (z,, = ¢,,) 
M(4))=M(l-e ) +1 (1-2 at) 
(3) 


ia , , =e a y =, ara atys 
ae -|Ve Ata 4 Ye Alte = 


In fact, it is convenient to study the decrement to achieving the maximum reward 


V) + 1 ona fractional basis; this is 


Vit 
D(1,; ) a om -- . 
oe (4) 
= E e042 der, Me | oy See 
Fee r = V/V, +V,), 5 =¥,/V, 4%). 
We want to pick 72 = Zopt that minimizes this decrement. Now 
O Dr 
( ) Ee |- ra Put ine ia 
OD (5) 
OUD Ge 
H *) _ rAi etn Sara glee > 0. 
Oe : 


This tells us that D(z,,) is bow/-shaped, having just one bottom as shown in Figure 
2-2. One of the above pictures must describe the situation; which one depends on the 
various parameters. In case (I), the unrestricted minimum of D(z,,) occurs before /,,, the 
time when Task 2 appears. A change to Task 2 at this time cannot be sensible under the 
current model because Task 2 is not available; the best feasible time must be at /, for this 
case. If case (II) prevails, then there is a bona-fide time between t,, and f,, to change to 
Task 2. If case (III) holds, then it is best to continue with Task | to the end (“a bird in 


hand...”?) and then switch over to Task 2. Note that only one switch 1s allowed. 


(I) (II) (III) 


£17 (I) r19(I) > (IID) 
E>) £12 


Figure 2-2. Bowl-Shaped 


The above can be formalized by finding 7,,, the global/unrestricted minimizing 


value of changeover time threshold by minimizing (finding bowl-bottom coordinate) of 


LD 7/2). Then 





Topt = 24 eae, CASE 1 
= i ift,,s ite Sh, CASE i (6) 
Sih eae CASE III 
ae D 
To find a formula for 7,, differentiate D(z,,) and solve aa OF 
12 
OD + , , 
(V, a yon) a) = _ eV (an) = VA, e172 +V,A, falter) 
OT) OT, 
PD r,) V(x) an ) 
Vet. aos = _ ae = +V. 2 7AM un 4 42 po fot) sy 0. 
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So, equating the first derivative to zero and solving gives: 





(3) 


= of A 9 
i) = er) Nea "ey ed (9) 


Case (I) holds if r,, <¢,,, in which case opt = 4. 











Case (II) holds if ¢,, < ti < ¢,,; then Topt = 712. 
Case (IH) holds if t2 > 4,,; then topt = ¢,,; 
cf. Gaver and Jacobs (1996) [Ref. 7]. 


For Case (I) where ti2 must be less then 4, in the solution to Equation 9 we have 


/ sy 
= In a Hot > = [> 
er eae Te ees etn 











ras) 

In SO A fay aes 10 

Vag) (7A * Mal Aale (10) 
V = eM Sy artes - Azty 


A} 


then T,2 will less than ta), in which case the server will choose T opr = t21. 


Case(II) if to, < T12 < tiz; then Cop = S12 
ArV > re : 
Case(III) 1fV’; = mee "22 then 12 > ti2, in which case Top: = t12} 
I 


From Equation 9 we know that Top is governed by In ( ViA1/ V2A2). If Vidi < V2A2 then 
In ( ViA; / V2A2) is negative. Notice that V,;A, < V2A2 is equivalent to V, /(1/A;) = 
V/E(T,) < V2/E(T2) = (V, /A2). This means when the expected reward value gained from 
killing Task | per unit time is less than Task 2; the system will allocate less kill time for 


Task | in order to maximize the total expected reward value. Under this situation the best 


Strategy for the server to get more expected return value is to switch early to work on 
Task 2. On the other hand if the expected return value per unit time of Task | is more 
than Task 2, then the server should spend more time on Task |. This property indicates 
that a system will spend more time on a high value and high kill rate target, which is 
intuitive. 

For example, suppose two tasks arrive simultaneously and the available 
engagement time is one unit for both tasks; the Task 2 has a fixed value 5; A, and 2 each 
vary from 0.05 to 5. Figures 2-3a to 2-3d show that if the Task | is more important 
(higher value) than Task 2, then the system will assign relatively more value, i.e. larger T 2 
to Task 1. The model reflects the fact that the best strategy for the system to get more 
return value is to spend more time on Task 1. 

In Figure 2-3a Task | value (V;=1) is less than Task 2 value (V2=5). In order to 
get more value the system will switch to Task 2 early. So the threshold (Opt_tau12) is less 
than 0.5 in most cases. However, the right-inner part (area A) of the figure shows that if 
Task2 is relatively much harder to complete (1.e. Az is very small) than Task1, we had 
better continue to work on the easier job instead of working on a job which we may not 
finish. In Figure 2-3b (V,;=V2=5) for most of the cases we switch around 0.5 unit of time. 
However, the left-inner part (area B) shows that if A; 1s very small the strategy to get 
more value 1s to switch early. 

Figures 2-3c and 2-3d display results for a case in which V, is much higher than 
V2. Most of the time the optimal t;2 is above 0.5 unit time. Notice that in Figure 3d area D 
is lower than it’s vicinity. From Eq-9, we know that when A, Is relatively large then the 
optimal t)2 will decrease by the amount about proportional to I/(A :+A 2). This property 
indicates that when the system is efficient enough for the first target, the system should 
reserve some time for the second target. 

Figure 2-4 displays results that when Task 2 value increases from | to 45 the 
optimal threshold value decreases. Figure 4d shows that the value of Task 2 (V2= 45) is so 


great that the system had better switch over to Task 2 as soon as possible. 
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Figure 2-3. Optimal t,2 Increases As V, Increases 
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Figure 2-4. Optimal t,. Decreases As V2 Increases 





However, when A> increases to 5 the weapon efficiency for Task 2 become high enough so 
that the system will reserve some kill time for Task 1. 
G. MODEL WITH PERFECT BDA INFORMATION 

[In this model, we assume that we have damage assessment information from a 
friendly source. The damage assessment information is knowledge that the target is killed 
when it is killed; that 1s knowledge that the task 1s completed when it is completed. The 
task values are the same as in the previous model. However, the rule of engagement is 
slightly different: the system starts immediately with Task | and pursues it for a (random) 
time min(T), T12); T; is the time to completion (acknowledgment of BDA) of Task | and 
T12 18 a threshold value. After the time, minimum (Tj, T12), the system switches to Task 2. 
We derive the expected return as follows: 


By conditioning on T, = ¢, (Ref. 3], the conditional expected return is 
V(t454,)=V, +V. (1-e ae ie, Blog 
=Vi Vile 2) ey ot Shy (11) 
= V,{1—e7*lter) i Shah. 
Now remove the condition: 
V(r.) =V,(1-e**) +¥, (1-24 1-2 ale) | 
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cf. Gaver and Jacobs (1996) (Ref. 7]. 


If 4, = 4, = 4, then (12) becomes 


le) a ma =, anf = AN er, —e - Alan se -t,)| (13) 
ie Special Case: 2, =A, =A 
Assume A, =A,= A. Note that a decision concerning the value of 7,, 1s only 


needed if T, > ¢,,. Hence, to determine the value of 7,,, we consider 


FG) = Vas 
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21s “lee No. 
If 1, 2 V,, the maximizing value of 7,, is ¢,, since the server is assumed not to 


return to Task 1. 


Assume lV’, < V,; note that f (1,,) = V,|1 = re Ta | 








Thus 
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Thus, the maximizing 7,, equals 4, 1f fF, <V,e 


Assume J, >}, >Vye Alte) Note that 








Zr, [f(z..)- A(t) =} Ae Nose) —b,Ae ‘Ase ta) (19) 
and 
2 
ss f(z.) - A(t) = Vivre Ar t1) < On (20) 
ees: 


* y 7 y Sail ay —fa : ee ® 
MIS ih ee) > Ve V2 ta) then the maximizing value of z,, Is 





‘ | Le 
T,, =ty Hog | (21) 
l 


2. Summary 
If A, = A, = A, then the maximizing value of z,, is as follows, cf. Gaver and Jacobs 


(1996) [Ref. 7]. 
™ if V, < Vie Meta) 
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3, General Case 
Assume A, = (1 + c)A for c >-1 and A, = A. Since the times to complete the tasks 
have independent exponential distributions, the lack of memory property suggests that we 


may assume /,,=0 to find the maximizing 7,,. Rewriting (12), for 0<7,<4, 


f(a) = v(1 — er) aE —e%'n jehae + a 1 eg la-e)ri | 
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Further, 
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- [f(z,2) - f(0)|=!, A(l+ cle“ 4 Vie 42 ae UO ee. | (25) 
OT 1) 
Setting 
(z,,)- £(0)] = 0 (26) 
results in the equation 
Vi (l+c)e*™ =V,e°*2 | (27) 
Solving for 7,, results in 
| V. 
: =t,,- = = ; 28 
OS ese) a 


Thus, (12) with 4, 2 0, A, = AC +c), and A, = A, the maximizing 7,, 1s 
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(29) 
cf. Gaver and Jacobs (1996) [Ref. 7]. 


Comparing (29) and (10), with the parameterization 4,=A(1 +c), 2,=A for 


c > —1, note that with no-BDA information, the switching time 7, = 7, 


Toe = ha (30) 


with 7, =7,, only ifc=0. Thus the maximum amount of time to be devoted to Task 1 
after (,, is smaller if the decision maker has no-BDA information than if he has perfect 
BDA information. The following graphs Figure 2-5 and Figure 2-6 use the same 
parameters as Figure 2-3 and Figure 2-4. The results display the property that no matter 
how the parameters change that the optimal threshold t,2 for BDA system tends to be 


greater than or equal the optimal threshold t,2 for no-BDA system. 
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Figure 2-5. The Optimal 1,2 for BDA System Is Greater ( V, increases) 
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Figure 2-6. The Optimal t,2 for BDA System Is Greater ( V2 increases) 


D. THE MEASURE OF EFFECTIVENESS (MOE) FOR BDA INFORMATION 


Selection of a Measure of Effectiveness (MOE) is perhaps the most important part 
of any analysis. One of our main objectives is to find a quantitative way to evaluate the 
value of information. We need a MOE for information that is measurable, quantifiable and 
measures to what degree the (real) objective is achieved [Ref. 4]. We may choose the time 
needed to finish a task or the value of the killed targets we achieved. In this paper we 
select the MOE of information to be the relative fractional improvement in expected return 
value from the no information case. We define the information gain to be 

Gain = ( Max (Vp) - Max (Vi7) ) / Max(Vv) , 
where Vp is the expected return value with BDA information, Vv is the expected return 
value without BDA information, and the maximum is taken with respect to the task 


switching time 712, 





It is important to determine where BDA information gives us the greatest reward. 
We also need to find the sensitive parameters. A good MOE should be able to characterize 
the situation under study. We recognize that the MOE is a function of several variables }, 
15, Ay, A2, and the overlap of available kill time windows. We need to know which 
variable is the most important, and over what ranges the variables should be studied. 

The most desirable and direct way to study the relationship between the 
parameters of the MOE function is to use an analytic formula to find out how the MOE 
function responds to each factor. Note that the size of an overlapped kill time window is 
just a relative size with respect to the weapon’s kill rate) So we may set a constant 
overlapped window length to one unit time interval. Thus, we can simplify our problem to 
four parameters: Aj, Ao, V, and V3. 

We can use the formula Gain = (Vp(t,*) - Va(t,*) )/ Va(t,*) to do three- 
dimensional plots. The optimal expected reward for BDA information Vp(t,*) 1s obtained 
by substituting the optimal threshold t,* into Equation 12. The optimal expected reward 
for no-BDA information Vn(t,*) is obtained by substituting the optimal threshold tn* 
into Equation 1. 

A three-dimensional response surface plot [Ref. 5] can numerically characterize the 
MOE function with respect to two factors. We choose weapon kill rates A; and A2 as the 
variables so that we can understand how the BDA information gain varies with respect to 
the weapon efficiency. We may also want to know how the MOE function changes with 
respect to the change of assigned task value. 

The results displayed in Figure 2-7 use the same parameters as before except for 
the task value V,;=10 and V2=5. It shows that the gain from the BDA information 1s 
relatively useful for a certain range of parameters. Note that in the region A the weapon 
efficiency for Task 1 (1) is small (near zero) and the gain from BDA information 1s also 
small. Since it is difficult to destroy the first target, the BDA is not useful which is 
intuitive. In the graph region B, the BDA information gain is higher than in region A. 


Since A, increases, we have a better chance to get to Task 2. However, in this region both 
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A, and Az are relatively large and the chance to get both task values ts high. Hence, we are 
indifferent gathering BDA information. Similar reasoning in region C suggests that if the 
chance to get the value of Task 2 1s small, both systems will work on Task |; thus we gain 
little from BDA information. In the region D, we can get the best benefit from BDA 
information which gives us a Gain of about 20 percent. 

From the above analysis, we know that if we need to make a decision whether to 
invest money to improve the weapon efficiency or to gather more information, we have to 


think carefully whether we can really benefit from our investment. 
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Figure 2-7. BDA Information Gain 
The results displayed in Figures 2-8 and 2-9 use exactly the same parameters as 


Figures 2-5 and 2-6. We notice that in Figure 2-8a the Gain in region A is higher than that 


the other regions; in this region the value of A2 is near zero and V; is relatively small. The 
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expected total task value that the systems gets 1s a small value too. Thus an improvement 
from BDA information will be significant. 

Figure 2-8d displays results for a system that assigns a very high value for Task | 
The information Gain value is below 0.1 all over the whole region. Since both systems 
allocate most of it’s kill time on Task 1, it makes little difference to have BDA information 
or not to have BDA information. 

Figures 2-8 and 2-9 also show that the MOE function is sensitive to the change of 
parameters. A proper MOE function should be able to reflect the change of the objective 
function value with respect to the change of parameters. We find that our MOE function 1s 
measurable, quantifiable, and measures the degree with which the real objective is 


achieved. 
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Figure 2-8. BDA Information Gain (V, Increases) 
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Figure 2-9. BDA Information Gain (V2 Increases) 
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tl. THE VALUE OF BDA INFORMATION IN ELECTRONIC WARFARE 


A. THE BACKGROUND 


In military operations, it has become increasingly important to use electronic 
countermeasures (ECM) and decoys to mislead an enemy’s interpretation of radar 
information. By using electronic jamming techniques the attacking side can imitate radar 
signals and present false targets and information. Moreover, the enemy can mix real 
targets with decoys; these when combined with jamming techniques, can greatly increase 
his probability of success. 

In the previous models, we have discovered that if a system pursues an optimal 
threshold policy to maximize expected reward, it should allocate the precious resource 
(kill time) to a target that has relatively high value and high kill rate. It is reasonable to 
assume that the attacker is willing to use ECM and decoys to dupe the defender into 
thinking a decoy is the more valuable target. If the attacker succeeds, the defender will 
spend less time on the real target. 

The optimal strategy for an attacker is to dupe the defender into working on 
decoys over the entire overlapped “window.” We assume the attacker has two choices: 
either to launch the decoy first, followed by the real missile; or to launch the missile first, 
followed by the decoy. Depending on the sequence chosen the battle outcomes will be 
different. If the attacker chooses to launch the decoy first, he is hoping to dupe the 
defender into putting off the switching over time for the real missile until the end of the 
overlapped window. If the attacker chooses to launch the missile first, he is hoping that 
the defender will choose to switch to the decoy as soon as possible. 

We assume that decoy and missile are detected simultaneously, the available kill 
time window for both targets are completely overlapped. Thus, the overlapped window is 
defined as w = ti2= t22 (See Figure 2-1). We also assume that the defender is unaware of 
being duped. Proceeding with the following analysis as in the previous models, if the 


decoy is launched first, we assume that the decoy is the first target presented to the 


ZS 


defender. If the decoy is launched second, then the second target presented to the 
defender is the decoy. By BDA information is meant the information that the first target 


has been killed when it is killed. 
B. VNIODELS FOR DEFENDER UNAWARE OF BEING DUPED 


7 Case I: The Attacker’s Decoy is First, Followed by the Missile 

The decoy and missile have fixed rates of being killed of Ag and A, respectively. 
The missile has a fixed value of V,, . Since the defender has been duped, he will be duped 
into assigning a duped value V, to the decoy and choosing an optimal threshold time tam 
to switch tasks. Because the actual reward value of the decoy is zero, the more value that 
the defender assigns to the decoy the more successful will be the attacker's ECM 
operation. 

By varying the decoy’s duped value V,, we can compare systems with and 
without BDA information to see how they work under different levels of ECM operation. 
The actual expected reward value of a system without BDA information can be obtained 
by rewnting Equation | with Vz = 0, to.= w. The actual expected value the defender gets 


1S: 
V6 (r4u) = Vag) ee] (31) 


where Tam 1S the duped optimal threshold for the no-BDA information system. 
If the defender has BDA information, from Equation 12 setting Vz= 0, ti2= t22 = w, and 


t21= t1= 0, the actual expected value the defender gets is: 
, = An w= tam) \ Aa Fete Ag Fm Aan W Ag tein ( Aa Am) 
V(t.) =V,] (Ie Je 4(1-e Me) oY 2 le ||(32) 
d m 


where Tam 1S the duped threshold for the BDA information system. 


Figure 3-1 displays results for Ag= 2, Am = 1 Vin = 5, w=. 
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Figure 3-1 System Duped to Think Decoy is More Valuable 


The x axis 1s the decoy’s duped value which is assigned by the defender. We vary 
the duped value /, from | to 20 to represent different levels that the defender has been 


duped. The y axis is a combined scale which includes the duped optimal ty, ( from 0 to | ) 
and the actual expected reward value with both BDA and no-BDA information system. 
In Figure 3-1, we find that when J’, is less than 4, the actual expected return value 


ofa system with BDA information is less than that a system without BDA information. 
There are two reasons why BDA information may not be advantageous: First, 
from the previous model we know that a system with BDA information tends to make its 
changeover threshold higher than a system without BDA information. This property 
causes a system which has BDA information to waste more time on the decoy. Second, 
the decoy is not likely to be destroyed within a short time interval, so the BDA 


information will not be helpful. 


vp 





The no-BDA information system has shortcomings when the system is duped into 
thinking that a decoy 1s more important than a missile. A system with no-BDA 
information, may set its threshold towards the end of the window, resulting in the no-BDA 
system returning no reward value since the system has spent all its time on a decoy. 

For a system with BDA information, even though it is duped into thinking a decoy 
is a missile it can still receive some reward value V’,, from the real target since it uses min 
(Tz, 7,,) strategy. If the decoy is destroyed, the system will immediately start to work on 
the real target. The worst case for the system with BDA information is to set its threshold 
to the end of window ( Tam = w ). In this situation, the expected return value is obtained by 


conditioning on Tz. 


V,=V 


m 


f( -e/m Nig een 
(33) 





A 
=V,J(-%)-e* 7, = Wenge 


From Equation 33 we know that the larger Aq is, the easier it 1s to kill the decoy 
and the more expected return value a system with BDA information will receive. Figure 3- 
2 displays results for the same parameters as in Figure 3-1 except that the kill rate for the 
decoy is Ag = 4. It shows the actual expected reward (V,=2.6) is higher than that of 
previous example (V,=2). 

From the above analysis the models give some insights into the value of BDA 
information in an ECM environment. The attacker should make the defender think that the 
decoy is very valuable and easily destroyed. The attacker should create a decoy that take a 
maximum time to destroy. For the defender who has BDA information, the information 
should improve his efficiency in destroying the decoy. Thus, the defender can benefit from 


BDA information. 
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Figure 3-2 System Duped to Think Decoy is More Valuable A= 4 


2. Case II: The Attacker’s Missile Is First, Followed By The Decoy 

The attacker launches the real missile first then launches the decoy. The missile and 
decoy have fixed rates of being killed of Aw = 1 and Ay = 2 respectively. The missile has a 
fixed value V,,,= 5 . If the defender is duped into thinking that the second target decoy 1s a 
missile, the system with BDA information will have a higher actual return value. Both 
models, with and without BDA information use the formula for “actual reward value “ of 


V,,(1 — eo faint | However, the duped optimal t,,~ value for a BDA information is higher 


than that of a no-BDA information system. So the actual expected reward for a system 
with BDA information is higher than that for a system without BDA information. If both 
systems are duped into thinking that a decoy is high value target, then both will receive no 


value, since they both will set the duped optimal threshold t,,7 to zero. Figure 3- shows 


oy 





both the BDA and the no-BDA system receive zero value when duped into thinking a 
decoy 1s the most important target. 
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Figure 3-3. BDA and No-BDA System Receive Zero Value 


3: Case II] Countermeasure by Maxmin Expected Reward Criteria 

Assume that the defender does not have enough information to correctly 
differentiate between a decoy and a missile. However, the defender knows that the 
attacker may take advantage by using a decoy. The defender must make a decision to 
choose an optimal threshold t which will not be affected greatly by the enemy’s ECM 
efforts. We assume that a conservative defender will use a “maxmin criteria.” This means 
the defender will choose a threshold among all possible thresholds with the “best” of the 
worst outcome values. 

Assume there is only one missile and one decoy and that they appear 


simultaneously. Assume their available kill time windows are completely overlapped. First, 
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consider a system without BDA information. It will choose a threshold t to engage the 
first target. It will receive a reward value by destroying the real target. If the system 
engages the missile first, the kill time allocated to the missile will be t . The expected 


reward value in this case will be: 
u(t )=V,,j1—e°*** | (34) 
If the system engages the decoy first, then the kill time allocated to the missile will be the 


remaining time (w-7) . The expected reward value will be: 


ole) v1 Set indi (35) 


If the no-BDA information defender uses a “maxmin criteria,” then his optimal expected 
reward value will be V,(t* ) = Max ( Min ( V(t) ) , Min ( V2 T) ) ), 
where the minimum Is taken over all switching times. 

Following similar reasoning yields for a system with BDA information an actual 


reward value of: 
Voi(r)=V,,[1-e 77" | (36) 
or 


V(t) = V (0-¢ ~Am\ Ww “Ne ae (em) Af es] 


Ay oe 
(37) 
The optimal expected reward value for BDA system will be: 
Vi(t* ) = Max ( Min ( Voi( t) ) , Min ( V2 Tt) ) )- (38) 

In Figure 3-4 we use the same parameters Am = L2 =1 ,.Vm = 5, Va = 0, w = 1. In 
Figures 3-4b to 3-4d situations the BDA information system has a decoy kill rate (Ag = L1) 
equal to 5, 3 and 0.2 unit respectively. The Figures display results that a system with BDA 
information using the maxmin criteria will tend to set its threshold greater than half of the 
available kill window provided the kill rate of a decoy is high. In this manner the BDA 


system can also get more expected value than a system without BDA information. From 


Equation 37 if the Aq is close to zero then the equation is approximately equal to Equation 


De, 





35. This means when the BDA information system uses the maxmin criteria it is at least as 
good as the no-BDA information system(Figure 3-4a). 
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Figure 3-4 System With BDA Information Using Maxmin Criteria 


4. Case [V Countermeasure by Hard Kill 

The proceeding discussion of enemy’s electronic countermeasure invokes one of 
the most important problems of operations research. It is possible for the defender to 
devote time to gathering more identification information and to use that information to 
ensure a “hard kill,” i.e. to use his weapon to shoot down any available targets. If 
electronic support measures (ESM) for identifying targets are not available, the decision 
maker must make a decision as whether to shoot both targets or only a single target. 

It is important to provide a sensitivity analysis of different weapon efficiencies for 


the decoy target. The purpose of the analysis is to provide the decision maker with a basis 
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for decision. Thus the decision maker can make his decision based on his own weapon 
efficiency. 

Assume that the defender believes his weapon can easily kill the enemy’s decoy. 
The defender wishes to use his weapon to destroy any targets and reduce the possibility 
that a missed target could deliver its warhead. Because identification information is not 
available, the defender has to subjectively assign values to targets. If the defender believes 
that the target is a decoy he will assign a low task value to that target. Let’s assume that 
the value or utility of destroying this decoy is 0.5. If the defender believes that the target 
is a missile, he will assign a task value of 5 units. For the discussion below, assume there 
are two incoming targets, one 1s the real missile and the other a decoy. 

The enemy’s tactical alternative will be defined as a random event X that has states 
of 0; and 62. Use 9; to represent the event that the enemy launches the decoy first then 
the missile and @ to represent the attacker’s opposite launching sequence. 

The defender’s tactical decision for both BDA and no-BDA systems will be to 
increase or decrease the threshold time to switch to Task 2. Since the no-BDA system will 
pursue the optimal threshold policy, if the defender believes that the decoy comes first, by 
assigning a low value to the first target the system will decrease the threshold time 
automatically. We denote the decreased optimal time threshold to be tam*; this policy will 
be denoted by d;. If the defender believes that the missile comes first, by assigning a high 
value to the first target the system will decrease the threshold time automatically. We 
denote the decreased optimal threshold time to be tma*; this policy will be denoted by d@. 
A similar threshold behavior can be applied to the BDA system. However, the decreased 
threshold time decision (d;) for BDA information system will be determined by the min(T4 
, Tam* ), this policy increased threshold time decision (d2) will be determined by the min(T» 
sTmd*) . 

The result of the defender’s decision d, ¢ D and attacker’s actions 8, € X will 


generate the payoff of R,. The payoff matrices for no-BDA and BDA information systems 
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are shown in Table | and Table 2 respectively. The payoff for both systems can be 
calculated by using the equations appearing in Appendix A. 

Figure 3-5 displays results for the attacker launch sequence: the decoy comes first 
followed by the missile and the parameters Ay = 1, Vn = 5, Vz = 0.5, tra = te2 = 1, th = to = 
0. We vary the decoy kill rate Ag from 0.1 to 15. When the kill rate for the decoy Ag is 
above 5 units, the actual reward value(R2;gp;) for the BDA information system which 
misidentified the target sequence is nearly the same as the actual reward value(R;;gp,) if 
the BDA system correctly identified the target. Since the defender believes that the missile 
comes first and reserves more of the time for first target (decoy) the BDA information 
system with a high decoy kill rate can ameliorate the effect of the misidentification. 
However, the best case for no-BDA information can only achieve | unit actual expected 


reward if it misidentifies the attack sequence. 


Attacker’s Action—> Use 9, (Decoy / Missile) Use 8+ (Missile / Decoy) 
Defender’s Action Sequence to Attack Sequence to Attack 
Use d; ( Decoy / Missile) Correctly Apply Threshold Incorrectly Apply Threshold 


Sequence to Defend Tee Ri ie Re 
Use d> ( Missile / Decoy) Incorrectly Apply Threshold Correctly Apply Threshold 
Sequence to Defend tae Ro Cad li a 


Table 1. Payoff Matrix for No-BDA System 












Use 8, (Decoy / Missile) Use 0, (Missile / Decoy) 
Defender’s Action Sequence to Attack Sequence to Attack 
Correctly Apply Threshold Incorrectly Apply Threshold 
Sequence to Defend Min( lie meeeeeeR ami Min(Tm Tam ) 2 Rr2epa 
Use d> ( Missile / Decoy) Incorrectly Apply Threshold Correctly Apply Threshold Min 
Sequence to Defend Minty) ores Min(T «tdi ).: Race 


Table 2. Payoff Matrix for BDA System 












Figure 3-6 displays results for the attacker launch sequence: the missile 
comes first followed by the decoy. The parameters are the same as those of Figure 3-5. 


We find that if the kill rate for the decoy Aq is above 10 units, the actual reward value 
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(Ri2gp1) for the BDA information system which has misidentified the target sequence is 
about the same as the actual reward value (Ri;gp,) if the system correctly identified the 
sequence. Since the defender believes that the decoy comes first and allocates less time for 
first target (decoy) the BDA information system need a higher efficiency of Ay = 10 rather 
than Ag = 5 to make up for the identification mistake. However, the best case for no-BDA 
information can only achieve | unit actual expected reward if it misidentifies the attack 
sequence. 

From the above analysis, we conclude that if a system has BDA information and a 
high weapon efficiency, it is possible for the system to counter the decoy by killing both 
targets. However, for a system with no-BDA information the chance to successfully 


counter the decoy effect is less than a system with BDA information. 


Sensitivity analysis of different kill rayes for decoy 


Actual expected reward 





Different kill rate of decoy 


Figure 3-5 Weapon Efficiency Sensitivity Analyses for Situation in Which Decoy 
Comes First 
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Figure 3-6. Weapon Efficiency Sensitivity Analyses for Situation in Which 


Missile Comes First 


IV. CONCLUSIONS AND POSSIBLE FOLLOQW-ON RESARCH 


A. CONCLUSIONS 


Recently the People’s Republic of China has been using their M-9 medium-range, 
mobile-launched ballistic missiles to intimidate my country, Taiwan, the Republic of China. 
The M-9 ballistic missile which can carry a 1,100 lb. single warhead has been launched and 
splashed barely 20 nautical miles away from Taiwan’s main seaports, Keelung and 
Kaohsiung. It is reported that bank depositors have withdrawn $370 million from the 
Taiwan banks. The country’s economic outlook has been greatly affected by the ballistic 
missile attacks. Moreover, the political confrontation is not just between Taiwan and 
mainland China but also involves the United States. 

Again this is a demonstration that the threat from theater ballistic missiles is 
growing. Such weapons can play a major political role in a regional conflict. The reasons 
for a theater missile defense project are: (1) to protect allies and troops deployed overseas 
in the theater conflicts; (2) to discourage global ballistic missile proliferation; (3) to reduce 
the chance that an ally is politically affected by the threat of a missile attack. The United 
States should devote more efforts to theater ballistic missile defense. 

However, theater missile defense requires major investment. The proper evaluation 
of weapon and information efficiency should be carefully studied. This thesis investigated 
the effects and value of battle damage assessment information availability in the defense 
against sequential missile attacks. It was found that both no-BDA and BDA information 
systems may both allocate their scarce resources, i.e. the available kill time, to achieve the 
best battle outcome, but that BDA can provide an advantage to the defender. 

For the purpose of investigating, and to quantify the value of information, we 
define a measure of effectiveness(MOE) for information. Without a proper MOE function 
to provide quantitative insights into feasibility and critical physical factors, a proper 
decision under uncertainty cannot be made. Developing a MOE function and applying it to 


the question of allocation of scarce resources in the light of the available information is not 
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only for the purpose of information evaluation, it also serves the purpose of assisting in the 
selection of a sound tactical strategy for decision making. The MOE function can provide 
insight into how the various system parameters interact and how sensitive they are to 
changes. The MOE function demonstrates that the effect of information is measurable, and 
quantifiable, and that a MOE can quantify the degree that the real objective is achieved. It 
can also help in understanding the trade-offs between weapon efficiency and the value 
information. In conclusion, the MOE function can be used for BDA information 
evaluation. 

In Chapter HI, we compare the value of BDA information and no-BDA 
information in certain situations involving electronic warfare (EW). Under different levels 
of deception, the BDA and no-BDA information will result in different rewards. A careful 
design of a system must be made to really benefit from BDA information. In the design it 
should be kept in mind that BDA is not a panacea. Depending on the enemy’s strategies 
the outcomes will be different with BDA or with no-BDA system. However, it is always 


desirable to increase the first target kill rate to benefit from the BDA information. 
B. POSSIBLE FOLLOW-ON RESEARCH 


In modern warfare, information plays an important role. In this thesis, the 
discussion is primarily about the value of BDA information. However, in an EW 
environment target identification information can play a key role. Bayesian decision 
analysis could be applied to assist in the decision as to whether to develop assets to 
acquire or to improve weapon efficiency. If there is reliable target identification 
information, a defender may be able to choose to only shoot the real target with high 
probability and thus to save limited kill time and ammunitions. This issue would involve 
two kinds of conditional probability distribution. First, is the decision probability of 
particular outcome x given a particular forecast f. Second, is the likelihood probability of 
particular forecast f given a particular outcome x [Ref. 6]. The expected reward from 
using identification information to shoot at only one target might be at least as good as 


using a maxmin criteria, provided the likelihood probability specification is accurate and 
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the data on the possible targets are adequate. However, the defender can also change his 
threshold to maximize the expected reward. An appropriate model may be formulated as a 
two-person zero sum game. 

In this thesis, it is assumed that the target acquisition time is zero. However, in real 
life, it is possible to spend significant time on such a task. A problem will be how to best 
allocate the acquisition and kill time. How does a BDA and no-BDA information system 
respond? 

Finally, suppose there are M incoming missiles of N types with each having a 
different task value. The available time to work on tasks is limited and the available 
shooters (servers) are also limited. The question will be how to maximize the expected 


reward subject to the limited resources. 
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APPENDIX 






Task 2 (Missile) 


Task 1 (Decoy 


Figure 1. Task 1 come first ,Task 2 come after. 


A. NO-BDA INFORMATION CASE 


In this section, it 1s assumed that the decoys is launched first and the missile 
second. Assume that the decoy and missile have value V, and V,, and kill rates Ag and An 


respectively. If the system can correctly identify the sequence the expected return R11 1s. 
ipaeipe alae “= Ve ta) | (Al) 
By using t,, the optimal payoff will be : 
ene V(tu» ) =V,+V_,- Ve - Ve (tata) (A2) 
If the defender is duped into thinking that the missile comes first then the decoy 


comes after, the system will switch tasks after tna* and the expected return will be: 


Ge = V(c,.,') =V,+V_- Vje"™ md 4. Vg milla “| (A3) 
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B. WITH BDA INFORMATION AFTER MIN(Tag, Tam) 


In this section it is assumed the decoy is launched first and the missile second. The 
policy is to switch to Task 2 (missile) after min(Tg ,tam). The expected reward value can be 


obtained by conditioning on Tq = tg 


(rgct,) =, 40, (1-2 to) i 
=V, +0 (1 Pred Oe re Ss). (A4) 
= V1 — go Pnlta-ton) ) lo = ie 


Removing the condition, the expected payoff Ri, will be: 


Riven, =V (0, )=V, =e = \ eV, ees eer 


11BDA 
ve (1 _ eo Fmt Fam ) pg de Tm mo (e “2 _ go ~7aTam ) (AS) 


A bad 
* a an is d — a Ag ee 
— @ Fils g Patt) 7] ga “Am Kiam —f21) 
Tn ah 


If the defender is duped into thinking Task 1(decoy) 1s Task 2(missile) then he would 


switch task after min(Ta, tTma ). We can condition on Tg then the expected payoff Ro; 


would be: 
Vic; tla en eae 
=V,+V_ (l-e ™2") oye (et Si: 
=V — ein tnd)) wer, 2, 
(A6) 
Removing the condition: 
Kipp = V(Tna ) =Vj(i—-e ra) + V,| (1 —e° “in (| — eal) 
+(l-e Amn (tx mt ) Ye Ag lm +(e Agtr — eo }4'nd ) (A7) 
Ate eee, ) A, -(A,-As tee” —!>)) 
=p d°21 p ‘m \ fon — fy ——S ple in KEmd ~'21 1 | 
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Figure 2. Task 2 comes first; Task! comes after. 


c. NO-BDA INFORMATION CASE 


In this section, it 1s assumed that the missile 1s launched first and then the decoy. 
Assume that the decoy and missile have values Vz and V,, and kill rates Ag and Am 


respectively. If the defender correctly identifies the sequence the expected return R2» is 


H(r_,)=V,{1-eh™) +V,(t-e sto 
=V,+V,- [Vem re Vj] (A8) 

ge = Aca) =V,+V_- Wena n Ye Hem (A9) 
If the defender is duped into thinking Task 2(decoy) is Task 1!(missile) then he will use 


T 4, > in this case: 


Ry =V (tay) = Vg +Vg —[Vge ne + V0 | (A10) 


4] 


D. WITH BDA INFORMATION AFTER MIN(Ta, Ta) 


In this section, it is assumed that the missile is launched first and then the decoy. 
Suppose that the defender switch to Task 2 after min(Tm , tma ). The expected reward 


value can be obtained by conditioning on Ty, = tm: 


V (ratte =U, +R Al—e AO) ae 
=V+¥,(1-e ee) yc ipeaiaien (All) 
aly i(l- oa i fy s C md < ee 


Removing the condition then the expected payoff R2 will be: 
Ry apa = F(T) = v,,(1 — eo Fmind + v,| (1 Ce It = gloss) 
+12 Me ea (eng ae (at 


— Raat  —— —|I-e ae aM tna” _ 
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If the defender is duped into thinking Task1 (missile) is Task2 (decoy) the defender would 


use min(T,,, Tp ) in this case: 


Vrs, stn) =V_+V,(1-e@ rw) Late 
=) +V (=e Ge") RET Sap Sikes 
=V,(1—e@ 4 t2- tm )) oT eal 
(A13) 
Removing the condition the expected payoff Rip will be: 
Mea, = Vice} — lee ein Tam )+ VI (l—e7**" \—e@ “4 )) 
+ (Lae Malta Fag Aa Fang (get ga tm (A14) 
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